Reinforcement learning using expectation maximization based guided policy search for stochastic dynamics

- Mallick, Prakash; Chen, Zhiyiong; Zamani, Mohsen